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(57) Abstract 

The present invention relates to a method to identify regions of chromosomes that interact functionally within or between chromosomes 
which comprises the steps of: a) recombining a linear recipient vector with a target sequence present in at least one chromosome, wherein 
the recipient vector comprises at least a recipient sequence consisting of a nucleic acid sequence complementary to recombine with the 
target sequence and a tail consisting of a heterologous nucleic acid sequence; and b) identifying at least one region of the chromosome 
capable of recombining with the recipient vector by PCR means or by detecting a label and/or a tag on the recipient vector or the target 
sequence. 
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A METHOD TO MAP AND ISOLATE REGIONS OF CHROMOSOMES THAT 
INTERACT OR ASSOCIATE FUNCTIONALLY WITHIN OR BETWEEN 

CHROMOSOMES IN VIVO 

5 BACKGROUND OF THE INVENTION 

( a) Field of the Invention 

The invention relates to a method for mapping 
and isolating regions of chromosomes that interact 
together or associate functionally in vivo. The method 
10 of the present invention is also referred to as 
recombination access mapping (RAM). 

(b ) Description of Prior Art 

Currently in the field of molecular biology, it 
is becoming quite evident that gene regulation occurs 
15 through a complex network of processes. Transcription, 
replication and recombination of DNA must occur in a 
timely and appropriate manner or the outcome may be 
disastrous. Extreme control over these processes is 
required during development and differentiation of 

2 0 tissues in multicellular organisms. In contrast, 

disorder of these processes occurs during oncogenesis . 
Cancer cells often exhibit aberrant expression of genes 
as well as general genomic instability, a hallmark of 
which is an increase in recombination rates. 

25 Activation or repression of gene expression by 

transcription factors or repressors respectively has 
been studied in great detail both in vitro and in vivo. 
Changes in chromatin structure are intimately linked to 
the activation or inactivation of a gene and can affect 

30 the replication or recombination of DNA. Mostly in 
vitro studies and limited in vivo data has been used to 
determine such changes in chromatin as they relate to 
DNA transcription, replication and recombination. From 
this data it is apparent that chromatin is organized in 

3 5 DNA loop domains that are organized through interaction 

with a nuclear protein matrix. These domains average 
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in size from 60*100 kb and are assumed to be flanked by 
matrix attachment regions (MARS). MARS have been 
associated with functional domains of transcription and 
replication. Indirect methods such as Dnasel 
5 sensitivity and DNA cleavage assays involving 
topoisomerase inhibitors, have provided further 
evidence of higher order chromatin domains which may 
correspond to single loops (60-100kb) and loop arrays 
(300kb). Functional chromatin domains seem to exist 

10 for transcription and replication (reviewed by in 
Jackson, D. A., Bioessays 17:587-591, 1995). Strong 
evidence exists which suggests chromatin structure also 
plays a role in recombination as in VDJ recombination, 
formation and repair of double strand breaks in 

15 irradiated cells, and during meiosis; differences in 
meiotic recombination between imprinted domains. 

Therefore, the central question is raised as to 
the interaction * between chromatin domains and their 
accessibility to biological molecules are involved in 

20 gene regulation. Many studies of chromatin focus on in 
vitro data at the nucleosomal level (Wolf fe, A. 
Chromatin: Structure and Function, Second Ed. Academic 
Press Inc. San Diego 1995). Unfortunately, it is most 
likely that large scale changes in chromatin packaging 

25 beyond the nucleosomal level are primarily responsible 
for the maintenance of certain chromatin states, such 
as early or late replication and hetero vs. 
euchromatin. Very few techniques exist to analyze DNA 
interaction in vivo in a global fashion over the entire 

30 genome. 

Ectopic gene targeting is an alternative outcome 
of the gene targeting process in which a targeting 
vector acquires sequences from a genomic target but 
proceeds to integrate elsewhere in the genome. More 
35 specifically, ectopic gene targeting is a process by 
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which an extra chromosomal molecule (recipient) obtains 
DNA sequence from a target locus via one-end invasion 
and gene conversion followed by release of the 
recipient molecule and integration, complete with the 
5 newly acquired sequence from the target locus, 
elsewhere in the genome. Such events were first 
observed in gene targeting experiments involving the 
adenine phosphoribosyl transferase (APRT) locus in CHO 
cells (Adair, G. M., et al., Proc . Natl. Acad. Sci . USA 

10 86:4574-4578, 1989) and in experiments involving 
retroviral transfection of rat cells (Ellis, J. et al . , 
Mol. Cell. Biol. 9:1621-1627, 1989). Consequently, a 
model has been proposed for the mechanism of ectopic 
gene targeting (Belmaaza, A., et al. , Nucl . Acids Res. 

15 18:6385-6391, 1990; and Belmaaza, A. et al., Mut . Res. 
314:199-208, 1994). Instances of ectopic gene 

targeting and/or ectopic gene conversion have been seen 
in Drosophila ( roo element, p and hobo elements), 
plants, yeast (between dispersed repeated genes or Ty 1 

20 repeat elements), fungi (in Ustilago maydis, chickens 
(Ig rearrangement)), rabbit (generation of antibody 
repertoire), mice (germline ectopic gene conversion in 
spermatids, gene conversion between Line-1 elements, 
and humans (gene conversion between Line-1 elements and 

25 pseudo autosomal region on X and Y chromosome). 

Although the phenomenon of ectopic gene 
targeting is well documented, the question of where the 
recipient molecule integrates, with respect to the 
target locus, has not been determined. It is apparent 

3 0 from Southern analysis that the recipient integrates in 
most cases at a distinct site from the target but 
Southern analysis does not permit the determination of 
the relative position of the ectopic sites with regard 
to the target locus. 
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It would be highly desirable to be provided with 
a method allowing the identification of interactions 
between chromatin domains within or between chromosomes 
which may be involved in gene regulation. With such a 
5 method, the functional organization of the genome could 
be mapped. The understanding the three-dimensional (3- 
D) in vivo interactions between chromosome could help 
to better understand complex gene regulation during 
cancer or other disease states. 

10 

SUMMARY OF THE INVENTION 

One aim of the present invention is to provide a 
method allowing to define functional organization of 
chromatin in vivo with respect to interchromosomal and 

15 interchromatin domain interactions involved in gene 
regulation, including but not limited to replication, 
transcription and recombination. 

Another aim of the present invention is to 
provide a method allowing to mark domains of chromatin 

20 that interact functionally in vivo with a given gene 
locus for the purpose of cloning such domains or their 
visualization in 3-D using confocal fluorescent 
microscopy. 

Another aim of the present invention is to 
25 provide a method allowing to define points of 
interaction between chromosomes involved in 

translocation or ectopic gene conversion within or 
between chromosomes . 

Another aim of the present invention is to 
3 0 provide a method allowing to define chromatin domain 
interactions between chromosomes involved in epigenetic 
phenomenon such as imprinting, position effect 
variegation and transvection . 

Another aim of the present invention is to 
3 5 provide a method allowing to produce diagnostic ectopic 
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gene targeting distribution profiles, such as 
fingerprints, for a given gene locus. 

Another aim of the present invention is to 
provide a method allowing to determine changes in 
5 genomic organization associated with various disease 
states as a means of monitoring disease progression or 
onset. 

Another aim of the present invention is to 
provide a method allowing to study developmental 

10 changes in multicellular organisms such as during 
tissue development . 

Another aim of the present invention is to 
provide a method allowing for the placement of DNA 
elements or recognition sites for enzymes for the 

15 purpose of chromosomal engineering. 

Another aim of the present invention is to 
provide a means for mapping the distribution of double 
strand breaks in DNA, which are natural breaks or 
induced by any means, over a given region of a 

2 0 chromosome with respect to a chromosomal DNA sequence 
to be studied, which in turn allows for the definition, 
characterization and cloning of structural and/or 
functional genomic domain ( s ) containing the chromosomal 
sequence being studied. 

25 A further aim of the present invention is to 

provide a method allowing to assess the affects of a 
given drug or chemical on genomic organization and 
stability such as for defining oncogenic potential of a 
substance . 

30 A still further aim of the present invention is 

to provide a method allowing to define at what time in 
a cell cycle chromatin domains associate functionally. 

In accordance with the present invention there 
is provided a method hereinafter refered to as 

35 recombination access mapping (RAM), that enables the 
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elucidation of DNA interaction between domains of 
chromatin within the genome, In vivo. 

In accordance with the present invention there 
is provided a method to identify regions of chromosomes 
5 that interact functionally within or between 
chromosomes which comprises the steps of : 

a) recombining a linear recipient vector with a 
target sequence present in at least one 
chromosome, wherein the recipient vector 
10 comprises at least a recipient sequence 

consisting of a nucleic acid sequence 
(~ complementary to recombine with the target 

sequence and a tail consisting of a heterologous 
nucleic acid sequence; and 
15 b) identifying at least one region of the 

chromosome capable of recombining with the 
recipient vector by PCR means or by detecting a 
label and/or a tag on the recipient vector or 
the target sequence. 
2 0 In accordance with another embodiment of the 

present invention, there is provided a method which 
further comprises the step of: 

c) characterizing the region identified in step b), 
thereby mapping the region on a chromosome or a 

2 5 segment thereof. 

The "target sequence" may be a modified vector 
comprising a genomic sequence and a reporter gene or 
fragment thereof, which allows for the identification 
of the target sequence. 

3 0 The "genomic sequence" may be a gene or fragment 

thereof. 

The "target sequence" may be modified to 
introduce a first part of a "reporter gene", and 
wherein the recipient vector comprises a second part of 
35 the reporter gene, such that recombination of the 
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recipient vector with the target sequence allows the 
reporter gene to be functional, thereby allowing for 
the identification of the target sequence. 

The "recipient vector" may further include at 
5 least one fluorescence in situ hybridization signal 
(FISH) or one radioactive in situ hybridization signal 
allowing for detection of in situ, hybridization of the 
recipient vector with the target sequence. 

The "recipient vector" may also include a signal 
10 in situ hybridization for detection by electron 
microscope in situ hybridization of the recipient 
vector with the target sequence. 

The "tail" may have a sequence of at least 1 Kb 
in length, preferably of about 10 Kb in length. 
15 The "recipient nucleic acid sequence" may be of 

at least 300 bp in length, preferably of at least 500 
bp in length and more preferably of about 700 bp in 
length. 

The "reporter gene" may be a selection gene. 
20 The "selection gene" includes, without 

limitation, neomycin, puromycin, hygromycin and herpes 
simplex thymidine kinase. 

In accordance with another embodiment of the 
present invention there is provided a method to 
25 identify regions of chromosomes that interact 
functionally within or between chromosomes which 
comprises the steps of : 

a) recombining a linear recipient vector with a 
target sequence present in at least one 
30 chromosome, wherein the recipient vector 

comprises at least a recipient sequence 
consisting of a nucleic acid sequence 
complementary to recombine with the target 
sequence; 
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b) identifying at least one region of the 
chromosome capable of recombining with the 
recipient vector by inverse polymerase chain 
reaction (PCR); and 
5 c) characterizing the region identified in step b), 

thereby mapping the region on a chromosome or a 
segment thereof. 

In accordance with the present invention there 
is also provided a method to identify regions of 
10 chromosomes that interact functionally within or 
between chromosomes which comprises the steps of: 

a) recombining a linear recipient vector with a 
genomic sequence, wherein the linear recipient 
vector comprises a recipient sequence of about 

15 700 bp in length and a tail comprising a 

heterologous nucleic acid sequence of about 
10 Kb in length and at least one fluorescence in 
situ hybridization signal (FISH) allowing for 
detection of in situ hybridization of the linear 

20 recipient vector with the genomic sequence, the 

genomic sequence comprises a sequence of a gene 
present in at least one chromosome, the 
recipient sequence is a nucleic acid sequence 
complementary to recombine with the sequence of 

25 the gene and the genomic sequence is modified to 

introduce a first part of a neomycin gene, the 
recipient vector comprises a second part of the 
neomycin gene, such that recombination of the 
recipient vector with the genomic sequence 

3 0 allows the neomycin gene to be functional, 

thereby allowing for the identification of the 
genomic sequence? 

b) identifying a region of the chromosome to which 
the recipient vector is recombined by selecting 

35 surviving cells in neoraycin-containing medium, 



WO 98/55610 ^^CT/CA98/00532 



thereby indicating cells containing recombined 
recipient vector with the genomic sequence 
allowing for the expression of the neomycin 
gene, and detecting at least one fluorescence in 
5 situ hybridization signal; 

c) characterizing the region identified in step b), 
thereby mapping the region on a chromosome or a 
segment thereof. 

In accordance with the present invention there 
10 is also provided a method to map the distribution of 
double strand breaks in a chromosomal DNA sequence in 
order to define structurally or functionally genomic 
domains. The method comprises the steps of: 

a) recombining a linear recipient vector with a 
15 chromosomal target sequence present in at least 

one chromosome, wherein the recipient vector 
comprises at least a recipient sequence 
consisting of a nucleic acid sequence 
complementary to the target sequence and a 

20 specific DNA sequence for polymerase chain 

reaction ( PCR ) amplification or for fluorescent 
in situ hybridization (FISH) analysis and 
wherein at least the recipient sequence or the 
target sequence contains a recognition site for 

25 an enzyme or chemical means for inducing a 

unique and specific double strand break within 
the recipient or target sequences; 

b) identifying at least one region of the 
chromosome recombining with the recipient vector 

30 by inverse PCR or plasmid rescue; 

c) characterizing the size and position of a 
genomic domain containing the chromosomal target 
and recipient sequences by at least FISH 
analysis or pulse field gel electrophoresis, 

35 following the formation of specific double 
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strand breaks within the chromosomal recipient 
and target sequences by enzymatic or chemical 
means ; 

d) cloning at least in part the identified genomic 
5 domain of step c); and 

e) DNA sequence analysis of the genomic domain 
cloned in step d), thereby characterizing the 
genes and at least functional or structural 
sequence elements within the identified genomic 

10 domain. 

The method of the present invention as described 
above may preferably be used to characterize or 
identify functional or structural sequence elements 
such as origins of replication, matrix attachment 

15 sites, transcription factor binding sites, imprinting 
centers or insulator elements- Of course, the double 
strand breaks may be have been formed in vivo or in 
vitro. 

2 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1A and IB illustrate a prefered embodiment 
of RAM method in accordance with the present invention. 

Figs. 2A to 2D illustrate southern analysis of 
clones obtained and isolated with a RAM method in 
25 accordance with one embodiment of the present 
invention ; 

Fig. 3 illustrates the arrangement of the 
vectors sequences and their restriction enzyme sites 
used with a RAM method in accordance with one 
30 embodiment of the. present invention to obtained the 
clones of Fig. 2; 

Fig. 4 illustrates Fluorescent In Situ 
Hybridization (FISH) analysis of clones obtained and 
isolated with a RAM method in accordance with one 
35 embodiment of the present invention; 
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Fig- 5 illustrates a histogram of inter signal 
distances determined for target and recipient FISH 
signals in interphase nuclei; and 

Fig. 6 illustrates a "Double Strand Break (DSB) 
5 Proximity" model of a RAM method in accordance with one 
embodiment of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, there 

10 is provided a Recombination Access Mapping (RAM) 
method. Preferably, two vectors which can recombine to 
produce a functional gene such as neo+ via a one-end 
invasion mechanism of recombination (Adair, G. M. , et 
al., Proc. Natl. Acad. Sci. USA 86:4574-4578, 1989; 

15 Ellis, J., et al., Mol . Cell. Biol. 9:1621-1627, 1989; 
Belmaaza, A., et al., Nucl. Acids Res. 18:63 85-6391, 
1990; and Belmaaza, A. et al., Mut. Res. 314:199-208, 
1994) are used in the method of the present invention. 
Cells are transfected with a "target" vector. Clones 

20 containing the "target" vector are then transfected 
with a "recipient" vector. Ectopic gene targeting 
events are selected for example by G418 resistance. 
The distribution of ectopic events in relation to the 
target locus may be determined using two-color 

25 fluorescent In situ hybridization (FISH). Integration 
of the recipient may occur in close proximity to the 
target locus (close events: less than 2-3 Mb from the 
target) or in other chromosomes other than the target 
chromosome (far events). The distribution of ectopic 

3 0 integration is distinct for each locus and provides a 
"fingerprint" of chromatin domain interactions for a 
particular gene which are distinct for a given tissue, 
developmental stage or disease state. Interphase FISH 
analysis of far ectopic gene targeting events indicates 

35 that the chromatin domains containing each respective 
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vector can associate even though the recipient has 
integrated in distinct chromosome from the target 
chromosome - 

Therefore, such domains may be identified and 
5 cloned. Cloned domains may contain important enhancers 
or silencers and/or other regulatory elements of gene 
expression. They may also contain origins of 

replication and/or regulatory DNA elements involved in 
the replication of DNA . These domains correspond to 

10 DNA elements which may inturn be used to find DNA 
binding proteins responsible for gene regulation and 
the functional structure of chromatin. 

Fluorescent in situ hybridization (FISH) 
analysis, which can be used to identify the genomic 

15 location of distinct DNA sequences with a general 
resolution of ~100 Kb at interphase and ~2-3 Mb at 
metaphase (Trask, B. J., Trend. Genet. 7:149-154, 1991), 
provides a unique tool for analysis of DNA sequences 
with respect to their chromosomal positions and with 

20 respect to each other. 

A method for studying ectopic gene targeting 
that uses two vectors, a "target" and a "recipient" 
vector which can recombine to produce a functional gene 
(neo+) via the one-end invasion mechanism of 

2 5 recombination has been developed. A murine fibroblast 
cell line (LTA) was transfected with the "target" 
vector. Three distinct clones containing the "target" 
vector integrated in their genome were then transfected 
with the "recipient" vector. Ectopic gene targeting 

30 events, which are characterized by the acquisition of 
sequences from the target by the recipient vector and 
then its integration in the genome, were selected 
preferably by G418 resistance. The distribution of 
ectopic gene targeting events in relation to the target 

35 locus was determined using two-color FISH. The results 
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indicate that the distribution of ectopic gene 
targeting events is bimodal . Ectopic integration of 
the recipient vector occurred either in close proximity 
to the target locus <<2-3 Mb) or in altogether 
5 different chromosomes from the target chromosome. In 
contrast / illegitimate integration showed no bias for 
any single chromosome or chromosomal location at 
megabase resolution. A corollary to these observations 
is that both inter and intrachromosomal DNA 

10 interactions appear to occur during ectopic gene 
targeting. Therefore, the method could be used to 
determine which chromosomal domains within the genome 
are accessible to a given genetic locus. 

The method of the present invention will be 

15 referred to as Recombination Access Mapping or RAM. In 
its most basic form, a vector comprising only minimal 
homology (about 300 bp) to a target locus in the 
genome, which may be a native DNA locus or exogenous 
sequences and whose location or identity may or may not 

20 be known, is utilized- This minimal recipient vector 
can undergo ectopic gene targeting at the target locus 
and upon integration, the position of the recipient 
vector may be mapped by inverse PCR or other oligo 
based technologies. Selection of ectopic gene 

2 5 targeting events would occur by PCR in which a band is 

only produced upon recombination of the target locus 
and the recipient vector. By this means, rapid large 
scale "RAM fishing" may be effected using minimal sized 
fragments of DNA to map in vivo, 3-D chromatin 

3 0 interactions over the entire genome. 

Fig. 1A illustrates two plasmid based vectors 
used in the RAM method in accordance with a prefered 
embodiment of the invention. The vectors are depicted 
in linear form after digestion via Not I prior to 
35 transf ection . The vector A1059A,tk (target) contains a 
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3' truncated neo cassette. Vector BllSAdhyg 

(recipient) contains a 5 1 truncated neo cassette. Both 
vectors contain selection markers for illegitimate 
integration (HSV-tk and Hyg, respectively) and specific 
5 DNA for detection via FISH analysis (lambda DNA and 
adenovirus-2 DNA, respectively). Fig. lb illustrates 
the mechanism of ectopic gene targeting leading to 
reconstruction of a functional neo gene between an 
integrated copy of the target vector and an extra 

10 chromosomal recipient molecule. A simplified version 
of the vectors from panel A are depicted for clarity. 
One-end invasion of the of the target locus by the 
homologous 3 1 end of the recipient molecule leads to 
the formation of a D-loop. The invading 3* end of the 

15 recipient primes DNA sythesis leading to gene 
conversion and extension of the D-loop. Resolution by 
nicking of the D loop results in non-crossover and 
crossover products, the later involving integration 
within the target locus (i). Alternately, resolution 

20 can occur via unwinding of the newly sythesized strand 
and release of the recipient molecule or its 
displacement due to branch migration. In the case of a 
non-crossover event, the recipient molecule may 
integrate illegitimately in a different locus leaving 

25 the target locus unchanged (ii). Integration of the 
recipient molecule after gene conversion, elsewhere 
than within the target locus, is termed ectopic gene 
targeting. Illegitimate junctions are depicted via 
double, horizontal dashed lines where recipient DNA is 

30 joined to chromosomal DNA in the absence of homology. 

Fig. 2 illustrates southern analysis of mother 
clones Al, A6 and A14 and selected daughter clones. 
Genomic DNA was isolated and digested before 
electrophoresis and capillary transfer to nylon 

35 membranes as described in materials and methods. DNA 
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was digested with either BamHI alone (A, B and C) or in 
combination with Ndel (see D) and blots were probed 
with a neo probe that lacked the promoter sequences 



5 digested lambda DNA was used as a marker of molecular 
weight and is shown in lane ^- Lane A contains the 
digestion profiles of mother clone Al and five daughter 
clones- Two specific bands, appear for the mother clone 
Al (see A, lane Al and E, Al ) . The two bands indicate 
10 that two target vectors are arranged head to tail in 
which the 5,6 kb band represents an intervector band 
and the 8 kb band spans the junction of one vector with 
genomic sequences (E r Al ) . These bands are maintained 
in 4 daughter clones (A1.2, Al . 3 , Al.5 and Al . 9 ) 

15 indicating the target locus has remained intact in 
these clones. A1.12 is an exception in which the 
junction band has increased in size to approximately 
8.2 kb suggesting reconstruction of the full neo gene. 
Digestion with Ndel in combination with BamHI should 

20 produce a 1.1 kb band for the target locus or a 1.3 kb 
band for a corrected neo gene (D). In the case of 
Al . 2 and Al.5 the target locus remains uncorrected 
(i.e. maintenance of 1.1 bk band) while the recipient 
is corrected producing a 1.3 kb band. Clone A1.12, on 

25 the other hand, exhibits only a 1.3 kb band indicating 
that the target locus has been corrected- Such a 
pattern is indicative of a crossover event in A1.12 
which has replaced downstream sequences from the neo 
gene of one vector (including the neo gene and lambda 

3 0 sequences of both target vectors) with sequences from 
the recipient (E, A1.12). In B the digestion profiles 
of A6 and five daughter clones are shown. Lane A6 
shows a single target band of 5.0 kb which is 
maintained in all five daughter clones (lane A6.2 

3 5 through to A6-6) and represents a single integration of 



(Eag I/HincII fragment of pMClneopA ) . 
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the target locus (E, A6). In C the digestion profiles 
of A14 and five daughter clones are shown. A single 
3.8 kb band appears for A14 which indicates a single 
integration of the target locus in these clones (E, 
5 A14). This band is maintained in all daughter clones 
except for A14.4 in which the target band is shifted 
slightly to approximately 4.0 kb (C, A14.4). Again 
this would indicate that the target locus had been 
corrected. Indeed, double digestion with Ndel and 
10 BamHI gives the same profile as seen for A1.12. All 
other bands represent integrations of the recipient 
vector . 

Pig. 3 illustrates an arrangement of vector 
sequences in the genome, as well as restriction enzyme 
15 sites in accordance with the method of the present 
invention . 

Fig. 4 illustrates a FISH analysis of mother 
clones Al, A6 and A14 and selected daughter clones. 
Cell cultures were prepared for In situ hybridization 

20 and FISH analysis was carried out. DNA was 

counterstained with either propidium iodine (red, A, F 
and I) or DAPI (blue, B-E, G, H, J and K). Target 
sequences appear as green signals on a blue background 
or yellow on a red background. Recipient sequences 

25 appear as red signals against a blue background. 
Frame A shows a full complement of chromosomes from the 
mother clone Al in which two copies of the target 
vector have been integrated into a dicentric chromosome 
between two centromeres (white arrow). Daughter clones 

30 A1.2 (B) and Al . 5 (C) have integrated the recipient 
(small arrow) in close proximity to the target locus 
(large white arrow). Both the green and red signals 
can be seen separately rather than as a single white or 
yellow dot, suggesting the recipient and target 

35 sequences are more than 100 kb but not more than 2-3 Mb 
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apart (Trask, B. J., Trend. Genet . 7:149-154, 1991). In 
clone Al . 9 (D) the recipient sequences have integrated 
in the midarm of an acrocentric chromosome (small 
arrow) while the target locus has remained intact in 



(E) shows a crossover event in which the target 
sequences have been completely replaced by the 
recipient sequences at the target locus (white arrow). 
Mother clone A6 is depicted in frame F in which a 

10 single target vector has integrated into small 
satellite arms of an acrocentric chromosome (white 
arrow). In both clone A6.2 and A6 . 3 the recipient 
sequences have integrated in the midarm of acrocentric 
chromosomes (see small arrows in G and H, respectively) 

15 leaving the target locus intact in the original 
acrocentric chromosome (large arrows). Frame I shows 
the single integration of a target vector in the mid 
arm of a metacentric chromosome in mother clone A14 
(white arrow). The recipient sequences (small arrow) 

20 have integrated into the same metacentric chromosome 
within 2-3 Mb of the target locus (large arrow) in 
clone A14.4 (J). This is interesting as Southern 
analysis indicates a crossover event which should have 
resulted in an intermediate white signal (juxtaposition 

2 5 of recipient and target DNA) or loss all together of 

the green signal (i.e. target locus) as in Al.12 (E). 
Thus this clone may represent a rare event in which a 
crossover occured but the recipient was still able to 
integrate ectopically. Such a "broken arrow" suggests 

3 0 the commitment of both ends of the recipient molecule 

at the time of the recombination event rather than 
sequential participation of each end in recombination. 
Clone A14.6 is also shown in frame K, in which 
recipient sequences have integrated into the telomere 
3 5 of an acrocentric chromosome (small arrow) and the 



5 



the dicentric chromosome (large arrow). 



Clone Al.12 
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metacentric chromosome containing the target vector 
(large arrows) has been duplicated, most likely by a 
non disjunction event. 

Fig. 5 illustrates a histogram of the 
5 distribution of inter signal distances determined for 
target (green) and recipient (red) FISH signals in 
interphase nuclei- Photographic slides of nuclei were 
projected at distance of 3 m onto a screen and 
distances between red and green signals were measured 

10 in cm. The inter signal distances were pooled in bins 
of 0 to 12 cm (average diameter of an interphase 
nuclei) in 1 cm intervals. Fig. 5 depicts a histogram 
of the number of nuclei in each bin for random, 
unlinked sequences (grey bar; n=69; data from a two 

15 independent pools of ~220 clones), linked sequences 
(horizontal striped bar; n=63; from pooled data from 3 
independent clones) and for far ectopic events (white 
bar; n=247; pooled data from 9 independent clones). 
Bin 0 represents co-localization or near co- 

20 localization of red and green signals. All values have 
been normalized for 63 nuclei. Black bars represent 
the normal distribution expected from the mean (6.37 
cm) and standard deviation (3.27 cm) of the observed 
random, unlinked inter signal distances. The far 

2 5 ectopic events follow a normal distribution for bins 1- 
12 but the observed number of co-localizations seen 
(i.e. ~10 nuclei in bin 0) deviates significantly from 
the expected number (~1) with a p value of less than 
0.0001. (note: although data were pooled for several 

30 clones to increase the total number of nuclei observed, 
no significant deviations were seen between independent 
clones with regard to the distribution of inter signal 
distances ) . 

Fig. 6 illustrates a "Double Strand Break 
35 Proximity" model based on the ability of a double 
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strand break (DSB) to enhance recombination in the 



indicate the nuclear matrix/lamina and finer lines 



5 chromosomal DNA at the lamina delineates the end of one 



(double slanted lines on chromatin loops) occurs the 
cell cycle arrests and the DNA at the site of the break 
becomes associated with DNA repair proteins which may 

10 reside on the nuclear matrix as "repair factories". 
This association alters the accessibility of the domain 
of chromatin in which the DSB has occurred ( indicated 
by a circle of dashed lines) such that recombination 
over this domain is enhanced. An incoming linear DNA 

15 molecule can mimic a DSB and may therefore be targeted 
to repair factories at the nuclear matrix much like 
genomic DSB's. Accordingly, if linear recipient DNA 
(dark rectangle) is used by the cell to repair a 
genomic DSB in the same domain of accessibility as the 

20 target locus (white rectangle), one end invasion of the 
target locus and gene conversion would reconstruct a 
functional neo+ gene (Fig. 6A). Resolution of the 
event by unwinding would lead to two fluorescent spots 
adjacent to each other, the distance between them being 

2 5 within a local domain of chromatin accessibility of 

less than 2-3 Mb (i). Cross-over would lead to 
deletion of intervening sequences between the target 
locus and the site of integration of the recipient 
(with only one fluorescent spot for the recipient 

3 0 present). Alternately, far ectopic events would 

involve interaction of chromatin domains of two 
separate chromosomes (Fig. 6B). A linear recipient 
molecule used to repair a DSB on one chromosome would 
be able to interact with the target locus on a 
35 different chromosome if the domains containing the 



local domain in which it has occurred. 



Heavy lines 



indicate chromosomal DNA. 



Note that the break in 



chromosome and the beginning of another. 



When a DSB 
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target locus and DSB are in close proximity to each 
other (most likely by association with the nuclear 
matrix). One end invasion and gene conversion at the 
target locus would lead to reconstruction of the neo+ 
5 gene. Release and unwinding of recipient would leave 
the target locus and chromosome unrearranged with the 
recipient integrating in the chromosome with the DSB 
(ii). Resolution by cross-over with the target locus 
would lead to translocation between the two 
10 chromosomes. 

The present invention will be more readily un- 
derstood by referring to the following examples, which 
are given to illustrate the invention rather than to 
limit its scope. 

15 

EXAMPLE I 
Ectopic Gene Targeting 

Vector Construction and Preparation 

20 Plasmid pAl059XTk (target) and pBll5AdHyg 

(recipient) were derived from pMClneopA ( Stratagene ) . 
Briefly, 3' deletion of NgOml/BamHI fragment or 5' 
deletion of Eagl/Ndel fragment of the neomycin 
resistance gene, respectively, was followed by 

25 introduction of Not I restriction site in the Ndel or 
Aat II restriction site (to allow linearization of the 
vector). Next a herpes simplex virus thymidine kinase 
(HSV-tk) cassette (from pAGO) and a hygromycin (hyg) 
cassette (from p3*SS, Stratagene), respectively, were 

30 cloned into the multicloning site 3' of the truncated 
neomycin gene. Finally, 16 Kb X virus sequence 
(GibcoBRL) or adeno viral sequence (GibcoBRL), lacking 
a Not I restriction site, was cloned into the vectors 
between the the neomycin and HSV-tk or hygromycin 

35 cassette, respectively. Due to the large size of the 
vector, subsequent subcloning to prepare transfection 
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quantities of DNA was carried out using SURE cells 
(Stratagene) and normal alkaline lysis miniprep 
followed by G-50 column was used to purify the DNA. 
Vector DNA was linearized by Not I and subjected to 
5 phenol chlorophorm extraction and ethanol precipitation 
before being resuspended in 1 X TE for storage at 
-20°C - 

Cell Culture and Transfection of LTA murine fibroblasts 

LTA murine fibroblasts (Tk-, Aprt-) were 

10 cultured at 37 °C, 5% C02 in complete medium (DMEM-F12 
medium supplemented with 10% fetal bovine serum ( FBS ) ) . 
Cells were split the day before transfection and plated 
at either 5 x 10 5 - 1 x 10 6 cells (CaP04> or at ~60% 
confluence (electroporation) . Mother cells lines were 

15 produced by electroporation of LTA cells. Briefly, 
cells were trypsinized and concentrated by 
centrif ugation, the cells were then resuspended in 1 ml 
of complete medium at room temperature. Upon counting, 
cells were diluted to 2.5 to 5 x 10 6 cells/ml with 

20 complete medium and 400 ul of cell suspension was 
electroporated (300 volts, 900 uF) using a gene Zapper 
450/2500 apparatus (IBI) in the presence of 1-2 ug of 
target plasmid linearized at Not I. Cells were 

selected for integration of the plasmid in HAT medium. 

25 Positive clones were subcloned using glass cloning 
rings and expanded in culture for no more than 5 
passages before being stored in liquid nitrogen. 
Genomic DNA was digested with restriction endonucleases 
and Southern analysis was carried out to determine the 

30 number of integrations- Cells exhibiting a simple 
hybridization pattern of integrated plasmid were then 
subjected to CaP04 transfection (Current Protocols, 
John Wiley Inc. USA, p. 9.1.4-9.1.9, 1996). Cells were 
fed 2 hours prior to transfection with 10 ml of fresh 

35 medium. Approximately 10 ug of DNA was coprecipitated 
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with CaP04 and the precipitate was left on the cells 
for 4 hours followed by 3 min DMSO shock (10% DMSO in 
complete medium) or 16 hours without shock. Cells were 
washed with PBS twice and fed with 10 ml of complete 
5 medium. Thirty six hours after transfection 400 ug/ml 
of G418 (GibcoBRL) was added to the medium. G418 
resistant colonies were picked as described for HAT 
resistance above. In addition, colonies were also 
subjected to hygromycin (GibcoBRL) at 250 ug/ml to 

10 determine resistance to the antibiotic - 

Harvest of Cells for Fluorescent In situ Hybridization 

LTA fibroblasts were grown to near 95% 
confluence before being trypsinized and replated at h 
to % original confluence. Depending on the growth 

15 characterisitics of each clone, harvest of cells began 
at 20 to 22 hours after trypsinization by the addition 
of 2 drops of colchemid per 5 ml. Cells were incubated 
at 37 °C, 5% CO2 for 2 hours in the presence of 
colchemid , after which the cells were trypsinized and 

20 collected by centrif ugation in 15 ml falcon tubes. 
Cells were then subjected to hypotonic shock by the 
addition of 10 ml of KC1 (0.07 M # Sigma) for 20 minutes 
at 37 °C. Cells were then centrif uged again before 
fixation in 10 ml of ice cold Carnoy I (3 part MeOH/ 1 

25 part acetic acid) . Fixation was repeated 3-4 times and 
cells were dropped on frozen slides (Fisher). Slides 
were cured for 24 hours at room temperature before 
being frozen at -20 °C. 
Southern Analysis 

3 0 Genomic DNA was prepared as described above and 

digested with restriction endonucleases . Digested DNA 
was electrophoresed on 0.7% agarose gel (Agarose-NA, 
Pharmacia Biotech) and transferred to nylon membranes 
(Hybond N, Amersham) . Hybridization was carried out 

3 5 with radiolabelled probe in 0.5 M sodium phosphate, pH 
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7-2, 7% SDS and 1 inM EDTA for 16-24 hours at 65 °C. The 
blot was washed with several changes of 4 0 mM sodium 
phosphate, pH 7.2, 0.1% SDS at 65 °C, and 
autoradiographed at -8 0°C for 3 to 7 days. 
5 Fluorescent In situ Hybridization 

FISH analysis was carried out as previously 
described (Lemieux, N. et al. , Cytogenet. Cell Genet. 
59:311-312, 1994). Briefly, after Rnase treatment, 
chromosomes were denatured in 70% formamide in 2 X SSC 
10 at 7 0°C for 2.5 minutes. Hybridization was performed 
overnight at 37 °C in 50% formamide, 10% dextran 
sulphate, 2X SSC, 0.1% sodium dodecyl sulphate, IX 
denhardt (0.02% polyvinylpyrrolidone , 0.02% Ficoll, 
0.02% BSA, pH 7), and 1 mg/ml denatured sonicated 
15 salmon sperm DNA. Probes were denatured for 10 minutes 
at 9 5 °C in the same medium. Probe concentration was 4- 
5 ng/ul of dig-labeled adeno-2 viral DNA and/or 5- 
6ng/ul of biotin labeled X virus DNA in a volume of 
20 ul per slide. Rinses were performed at 37 °C for 2 
20 minutes, twice in 50% formamide in 2X SSC followed by 
twice in 2X SSC. In addition high stringency washes 
were performed at 42°C for 15 minutes once in 50% 
formamide in 2X SSC followed by a single 2X SSC rinse 
for 8 minutes at 37 °C. Probes were generated from 
25 using the Bionick Labelling System (GibcoBRL) for 
biotin labeled probe or the Nick translation Kit (with 
addition of dig-ll-dUTP, Boehringer Mannheim) for 
digoxigenin labelling . 

Fluorescent Detection and Image Acquisition 

30 After hybridization, the slides were incubated 

45 minutes at 37 °C with rabbit antibiotin (Enzo), 
4.6 ug/ml, in PBT (PBS: 0.2 N NaH2P04, 0.2 Na2HP04, 
0.15 M NaCl pH 7.3, containing 0.15% BSA and 0.1% Tween 
20). After 2 rinses in PBT at room temperature for 5 

35 minutes, incubation was continued for 45 minutes in the 
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presence of 10 ug/ml of biotinylated anti-rabbit goat 
antibody (GibcoBRL). Again after incubation slides 
were rinsed again as described before addition of 
f luorescein-streptavidin conjugate (GibcoBRL) at 8 
5 ug/ml for final incubation of 4 5 minutes. During 
double detection the incubations continue with anti-dig 
mouse antibodies (1 ug/ml), anti-mouse-sheep Fab 
fragment (14 ug/ml) and anti-dig-sheep-rhodamine 
antibody (20 ug/ml) (Boehringer Mannheim). 

10 Counterstaining of DNA by propidium iodide (as 
described in Lemieux, N. et al. , Cytogenet . Cell Genet. 
59:311-312, 1994) or DAPI or was carried out before 
visualizing slides in presence of 10-15 ul of antifade 
solution. Antifade solution contained p-phenylene- 

15 diamine (PPD, Sigma, USA), 1 mg/ml of a mixture of 
glycerol: PBS 9:1 (v:v) adjusted to pH 9 with NaOH. 
Slides were visualized on a fluorescence microscope 
(Aristopan, Leitz ) without a signal amplification 
system. Red, blue and green fluorescence was observed 

20 by viewing through a triple band-pass filter (Omega 
Optical Inc., Vermont, USA). Images were captured 
using a charge coupled device (CCD) camera (Xybion 
Electronic Systems) and MacProbe version 2.5 (PSI) on a 
Quadra 840av Macintosh™ computer. Color balance 

2 5 adjustments and file conversion were accomplished in 

Adobe Photoshop™ V2.5.1. Although images were captured 
electronically for publication, signals could be easily 
seen through the microscope and slide film was taken to 
attest this fact. 

3 0 Results 

One of the approaches of the present invention 
was to use ectopic gene targeting to analyse chromatin 
accessibility and DNA interaction in vivo. To do so, 
Fluorescent In Situ Hybridization (FISH) analysis was 
3 5 used to determine the integration pattern of an 
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exogenous vector in three distinct cell lines each 
containing a target vector. These three cell lines 
were derived by electroporation of the murine 
fibroblast cell line LTA (tk- aprt~) with the target 
5 vector A1059A/tk in a linear form (Fig. 1A) . The target 
vector contains 16 kb of lambda sequence (used for FISH 
analysis) flanked by a 3 1 truncated neo gene (used for 
gene targeting) and a HSV-tk gene which was used for 
selection. Three tk+ cell lines were chosen and 

10 designated as A6 and A14 ., containing a single target 
vector, and Al which contains two copies of the target 
vector (arranged head to tail). Each of the three 
lines was then subjected to CaP04 transfection with a 
recipient vector, B115Adhyg (Fig. 1A), containing a 5' 

15 truncated neo cassette with 600 bp of perfect homology 
with the 3 ■ truncated neomycin gene of the target 
plasmid. As in the target vector the recipient vector 
contains unique sequences for FISH analysis (16 kb 
adeno-2 DNA) and a selection gene (hygromycin ) - 

20 Homologous recombination between the overlapping neo 
sequences in the two vectors will produce a functional 
neo gene which can be used for clonal selection in the 
presence of G418. The recipient vector was linearized 
via a Not I site, directly adjacent to the homology, to 

25 favor ectopic gene targeting events by leaving only one 
end of the recipient vector homologous to the target. 

The usual gene targeting process involves 
invasion of the target by two homologous ends of the 
exogenous vector. Ectopic gene targeting involves 

3 0 invasion of the target sequence by only one homologous 
end of the exogenous vector, which then primes DNA 
synthesis leading to gene conversion (see Fig. IB). At 
this point, there can be two outcomes. The recipient 
vector can form a homologous junction with the target 

3 5 at the target site while the other end of the recipient 
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vector forms an illegitimate junction at or near the 
target site (Fig. lB(i)). In this case the target site 
is modified. The other possibility is that the 

homologous end of the recipient vector is released and 
5 the recipient vector integrates elsewhere in the genome 
(Fig- lB(ii)). In this case the target site is 

unchanged. Thus, by determining where the released 
recipient molecule integrates, one can determine what 
other areas (domains) of the genome were accessible to 

10 the target site at the time of the recombination event. 
This was accomplished by FISH analysis using probes 
specific for the target and recipient vector, which 
enabled the direct analysis of the distribution of. 
integrated recipient DNA with respect to the target. 

15 Characterization of Mother Clones Al, A6 and A14 

Mother cell lines Al, A6 and A14 were subjected 
to Southern analysis to determine the copy number and 
structure of the integrated target sequences (see Fig. 
2). By probing against the neomycin resistance gene, a 

2 0 diagnostic band(s) was produced for the target locus. 

Two diagnostic bands are apparent for Al at 5.6 Kb 
(inter-vector band) and 8 Kb (junction band) indicating 
the integration of two target vectors in a head to tail 
configuration (Al in Figs. 2A and 3). A6 and A14 
25 exhibit a single band of 5 kb and 3.8 Kb, respectively, 
which indicates a single integrated copy of the target 
for each cell line (A6 in Figs. 2B and 3, A14 in Figs. 
2C and 3 ) . 

Mother cell lines Al, A6 and A14 were then 

3 0 subjected to single color FISH to determine the 

localization of the target vector. The location of the 
target locus in these cell lines is shown in Fig. 4. 
Clone Al, which has integrated 2 copies (arranged head 
to tail in tandem) of the target vector, contains the 
35 integrated sequences in a single site between the two 
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centromeres of a dicentric chromosome (Fig. 4A) . A6 
contains a single integration of the target vector in 
the short satellite arm of an acrocentric chromosome 
(Fig. 4F) and A14 contains a single integration of the 
5 target vector in the mid arm of a large metacentric 
chromosome (Fig. 41). 

These cell lines were then transfected with the 
recipient vector. Selection for ectopic and 

illegitimate integration events was carried out in 

10 medium supplemented with G418 or hygromycin 
(respectively). Resistant clones were counted and 
homologous (ectopic gene targeting) and illegitimate 
integration frequencies were determined. Illegitimate 
integration in the three cell lines was similar 

15 exhibiting a mean frequency of 5.6 x 10~ 3 . Although Al 
had twice the copy number of A6 or A14, homologous 
recombination rates for all three clones were quite 
similar with an average frequency of 2.0 x 10~ 6 . This 
agrees with previous reports that demonstrated that 

20 copy number does not affect homologous targeting 
frequencies significantly in mammalian cells. 
Ectopic gene targeting exhibits a bimodal distribution 

Cell lines Al, A6 and A14 were transfected with 
the recipient vector and targeted events (G418 R clones) 

25 were selected to be analysed by FISH. G418 R clones were 
expanded with no more than 5 passages before 
cryopreservation and genomic DNA extraction. In the 5 
daughter cell lines analyzed for Al all but one (see 
Fig. 2A, A1.12) contained the 2 characteristic bands 

30 representing the target locus (i.e. 5.6 Kb and 8.0 kb 
band). The loss of the 5.6 Kb target band and an 
apparent shift of the 8.0 kb band, coupled with 
sensitivity to HAT medium (i.e. tk-) and FISH analysis 
(Fig. 4E) indicate that daughter clone Al.12 contains a 

3 5 crossover event. This event most likely involved the 
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deletion of the 3 * region beyond the neomycin 
resistance gene of the first integrated copy of the 
target vector along with the entire second copy of the 
target vector. Replacing these sequences is the 3' 
5 region of the recipient containing the adeno-2 viral 
DNA and hygromycin gene (see Fig. 3, A1.12). 
Consequently, both the lambda viral DNA and tk genes of 
both copies of the target have been deleted resulting 
in loss of the green fluorescent signal representing 

10 the target locus, which is replaced by the red of the 
recipient (Fig. 4E) . It is more apparent in Fig. 2D 
that A1.12 contains a X-over event at the target locus, 
as the neo 1.1 kb band disappears and is replaced by a 
1.3 kb band representing a full neomycin resistance 

15 gene (clones A1.2 and Al . 5 which are ectopic events are 
shown for comparison ) . 

Overall, it is important to note that of greater 
than 30 G418 daughter clones analyzed by Southern for 
5 different mother clones, only 2 were scored for the 

20 loss of the target locus. The four other daughter 
clones of Al contain bona fide ectopic integrations 
(i.e. target locus is intact in each of them). Two of 
them, A1.2 and A1.5, have the recipient integrated in 
close proximity to the target (see Figs. 4B and 4C, 

25 respectively) and retain the 2 characteristic target 
bands as shown in Fig. 2A. As well, in Fig. 2D the 1.1 
kb band of target locus is present as expected for both 
A1.2 and A1.5- The two fluorescent signals can be 
resolved as two closely spaced but distinct spots at 

30 mitosis thus allowing for setting the limit on the 
distance between the target and recipient in these 
clones as being less than 2-3 Mb (Trask, B. J., Trend. 
Genet. 7:149-154, 1991). Clones Al.9 and A1.3, on the 
other hand, had integrated the recipient vector in a 

3 5 different chromosome than the target (mid arm of a 
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small acrocentric chromosome (Fig. 4D); and mid arm of 
a metacentric chromosome , respectively. 

All 5 of the daughter clones derived from A6, in 
contrast, were the result of ectopic integration that 
5 occurred in different chromosomes than the target. A 
single band of 5.0 Kb representing the target locus is 
maintained in all 5 daughter clones indicating an 
intact target locus (Fig. 2B). Two daughter clones of 
A6, A6.2 and A6.3, are shown in Figs. 4G and 4H 

10 (respectively). Daughter clones of A14 contain the 

single diagnostic band of 3.8 Kb, which indicates the 
target locus is intact in all 5 daughter clones. An 
intermediate distribution of ectopic events is seen 
with one clone exhibiting ectopic integration less than 

15 2-3 Mb from the target, A14.4 (see Fig. 4J), and 4 
others exhibiting integration of the recipient DNA 
molecule in other chromosomes than the one containing 
the target. The diagnostic band for the target locus 
is very intense and slightly shifted for A14 . 4 . This 

2 0 shift was suggestive of a crossover event and Southern 

analysis using an alternate digestion (as in Fig. 2D) 
indicated that the target locus had been converted 
( appearance of 1.3 Kb band and loss of l.lkb band) yet 
FISH analysis indicated both target and recipient 
25 sequences were present (i.e. as separate spots at 
mitosis, approximately 2-3 Mb from each other; see 
Fig. 4J). A cross over would be expected to produce a 
single red spot as in A1.12 with loss of the green 
signal representing the target locus, or as an 

3 0 intermediate white color, which would indicate 

juxtaposed sequences of less than 100 Kb apart. 
Therefore, A14.4 most likely does not involve a 
crossover event and represents a rare ectopic event 
involving an as yet undefined mechanism. Fig. 4K shows 
3 5 colony A14.6, an example of one of the distant ectopic 
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integration events with an apparent duplication of the 
target chromosome {most likely explained by non 
disjunction ) . 

No insertion-type events were observed in our 
5 experiments. Such events were not likely since the 
recipient vector was linearized prior to transfection 
and only one end of the gene targeting vector is 
homologous to the target. 

Thus, it appears that there are two distinct 
10 types of ectopic integration events. Those events 
which are in close proximity (<3 Mb) to the target 
(close) and those that occur in other chromosomes which 
do not contain the target locus (far). Integration of 
the recipient vector on the same chromosome at 
15 distances greater than 3 Mb from the target was not 
seen. This certainly does not imply that such events 
would not occur but it does indicate that such 
integration events would not be more likely than ones 
on distinct chromosomes. 
2 0 Pooled FISH analysis of ectopic gene targeting events 

Pooled FISH analysis of an additional 24 G418 R 
clones, 12 clones for Al, 6 clones for both A6 and A14 
was carried out to discern trends in the preferences 
for certain chromosome morphologies for far 
25 integrations. In general, far ectopic integration 
events occurred in morphologically different 
chromosomes at multiple sites in the daughter clones of 
Al and A14 mother cell lines, whereas A6 contains a 
large number of ectopic integrations in the mid arm of 
30 acrocentric chromosomes. A summary of all 3 9 clones 
analysed by FISH analysis, singularly or in pool, is 
shown in Table 1. 
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Table 1 

Summary of G418 daughter clones analyzed by PISH a 





Aero 


Meta 




Total 
No. of b 
Clones 


X c 
over 


Same 
Chms 


Diff. 
Chms 


Total 


T 


c 


M 


Total 


T 


C 


M 


A1 


17 


2 


14 


3 


1 


0 


0 


1 


2 


1 


0 


1 


A6 


11 


0 


1 


10 j 


7 


0 


0 


7 


3 


1 


1 


1 


A14 


11 


1 


2 


9 


7 


1 


2 


4 


2 


1 


0 


1 


Totals 


39 


3 


17 


22 


15 


0 


2 


5 


7 


1 


1 


2 



5 Includes all daughter clones analyzed by pool or 
independently - 

b 5 
In the case of pooled clones: 10 cells were pooled 

for each clone and the resulting pool of cells was 

passaged twice before harvest for FISH analysis. 100 

mitosis were counted and scored for chromosome 

morphology and location of fluorescent signal (red) for 

recipient in relation to target locus (green). The 

resulting numbers of mitosis were converted to numbers 

of clones by dividing the number of each morphology 

type by 4.2 , as on average one would expect to see 4.2 

mitosis for each of the 24 clones in 100 mitosis. 

c Cross-overs where scored when no target (green) 
fluorescent signal was present but had been replaced at 
the target locus by the recipient fluorescent signal 
(red) . 

Abbreviations : Acro=acrocentr ic ; C=centromere ; 

chms=chromosomes ; Dicent=dicentric ; M=mid arm; 

Meta=metacentr ic ; T=telomere 



Al daughter clones showed a striking number of 
close ectopic events within 2-3 Mb of the target (14 
out of 17 clones), whereas A14 and A6 showed a much 
smaller number of close events (2 in 11 and 1 out of 
11, respectively). Two of the close events for Al were 
single cross-over events, and one event for A14 had an 
apparent cross-over coupled with ectopic integration, 
which brings the total number to 3 out of 39 clones 
(8%). In A6, 7 out of 11 clones (63.6%) were scored 
for the appearance of the recipient signal in the mid 
arm of an acrocentric chromosome . A14 did not show any 
bias for any one chromosome morphology or position and, 
due to the few number of far ectopic events, it was not 
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possible to discern a trend for such events for Al. 
Although superficially it would appear A6 has a bias 
for acrocentric/mid arm localization of ectopic 
integration, it must be mentioned that there are 3 
5 times more acrocentrics in the karyotype than 
metacentric and thus a frequency of ~64% does not 
indicate a statistically significant correlation for a 
specific chromosome morphology and position. What The 
analysis does indicate is that by using a pooling 

10 approach to FISH analysis one can produce a relatively 
large data set for analysing the distribution of 
ectopic gene targeting at a given locus. Our 
observations also lead us to suggest that far ectopic 
integrations can occur in more than one chromosome for 

15 a given locus. It remains to be determined if far 
events are random or if they occur in specific 
chromosomes . 

The distribution of illegitimate integration at 
megabase resolution is stochastic 

20 The large number of close events in Al vs. A6 or 

A14 raises a question of bias for integration in the 
dicentric chromosome. Southern or genetic analysis 
cannot indicate linkage over large distances on the 
same chromosome- Therefore, the distribution of 

25 illegitimate integration was analysed via PISH analysis 
since it gives a direct estimate of the distribution of 
illegitimate integration at megabase resolution. 

Approximately 220 or 250 hygromycin resistant 
(HYG+) clones were pooled for cell lines Al and A6, 

30 respectively, and subjected to two color FISH analysis. 
Approximately 3000 HYG+ clones were pooled for A14 to 
determine if the number of clones pooled for Al and A6 
could produce a representative distribution of 
illegitimate events. The distribution of illegitimate 

35 integration events in relation to the target locus was 
scored in 200 mitosis for relative position of the 
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fluorescent signal on a chromosome (centromeric, 
telomeric or mid arm) as well as the morphology of the 
chromosome (acrocentric, dicentric or metacentric) 
containing the signal (see Table 2). The distribution 
5 of illegitimate events for all three clones was 
similar, indicating that illegitimate integration in 
these clones can not account for any differences in the 
pattern of ectopic gene targeting. 

10 Table 2 

Distribution of Illegitimate Integration without 
Ectopic Gene Targeting 



Clone 

(pool 
size) 


No. of 
Mitosis 
(chms) 


No. of 
Aero 
chms 


No. Of 
Meta 
chms 


No. of 
Dicent 
chms 


Mid 
arm 


Telo 


Centro 


Signal 

In 
Same 
chms 


Double 
Signal 


Triple 
Signal 


A1 

(~220) 


200 
(205) 


97 


108 


0 


136 


43 


26 


2 


13 


1 


A6 

(-250) 


200 
(212) 


134 


76 


2 


154 


30 


26 


0 


13 


0 


A14 

(-3000) 


200 
(207) 


119 


80 


8 


132 


40 


27 


3 


12 


0 


Totafg 


600 
(624) 


350 
<56.1> 


264 
<42.3> 


10 

<1.6> 


432 
<69.2> 


113 
<18.1> 


79 
<1Z7> 


5 b 

<0.8> 


38 b 

<6.3> 


<0.2> fc 



a Percentage of 624 chromosomes scored unless otherwise 
noted; 



Out of 600 mitosis scored 

20 

Abbreviations : Acro=acrocentric ; Centro=centromere ; 
chms=chromosomes ; Meta=metacentric ; Telo=telomere 



The distribution seen suggests that illegitimate 
integration in the presence of a chromosomal target is 
not biased, on a megabase scale, for any given 

25 chromosome or chromosomal position (i.e. telomere, 
centromere, mid arm) including the target chromosome 
and locus, respectively- The distribution reflects the 
prevalence of each chromosomal position and chromosome 
morphology. For example, there are -18% telomeric 

30 signals on average and ~13% centromeric, or roughly two 
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to one, and ~69% mid arm. This agrees well with the 
majority of chromosomal DNA being scored as mid arm and 
a count of one centromere and two telomeres per 
chromosome. All chromosome morphologies are 

5 represented roughly according to the proportion found 
in the karyotype. The karyotype is modal at a mean of 
52 chromosomes and has on average 13 metacentric (or 
submetacentric), 37 acrocentric, 1 dicentric and 1 dot 
like chromosome (see Fig. 4A). Thus the chance of 

10 targeting any specific chromosome would be 
approximately 1 in 52 (~2%). Targeting the same 2 Mb 
unit of chromatin considering approximately 6000 such 
units in the average diploid mammalian genome would be 
2 in 6000 (~0.03%). The chance of targeting an 

15 acrocentric is ~56% and for a metacentric is ~42% which 
is slightly skewed from the theoretical values of 71% 
and 25% (respectively) towards metacentrics. One 
explanation for this may be that metacentrics contain 
more DNA in general (see size difference in Fig. 4A) 

20 than acrocentrics. Considering that in normal murine 
karyotype all 4 0 chromosomes are acrocentric and 
similar in size it would follow that each metacentric 
(composed of essentially two acrocentric chromosomes 
after Robertsonian fusion) contains approximately twice 

25 the amount of DNA of a single acrocentric. Therefore 
each metacentric may provide more potential sites for 
integration to occur. In fact when considering this, 
one can create a theoretical karyotype of 65 
chromosomes (i.e. 37 acrocentrics, 26 metacentric 

3 0 derived acrocentrics, one dicentric and one dot like 
chromosome). Using this karyotype the frequency for 
targeting a true acrocentric becomes ~57% and targeting 
a metacentric (i.e. actually two acrocentrics) is ~40%. 
These frequencies are very close to the actual observed 
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frequencies in our analysis for transgene integration 
in acrocentric and metacentric chromosomes . 

The dicentric chromosome was a target for 
illegitimate integration 1.6% of the time which agrees 
5 well with the theoretical value of 2%. Integration in 
the same chromosome as the target in about 1% of 
mitosis scored is noted. The majority of these were 
very close to the target and most likely targeted 
events as they appear at the same frequency as expected 

10 for ectopic gene targeting events. 

Interestingly, two separate signals for the 
recipient vector were seen in ~6% of the 600 mitosis 
observed for the three pools. This would suggest that 
in a given transf ection the incoming DNA will integrate 

15 in one locus in the majority of cells (>90%). The 
integrated DNA may be in tandem as well as physically 
separated by as much as 100 kb, as at interphase 
separate fluorescent signals could be discerned for 
loci which at metaphase appeared as a single intense 

20 spot. One triple signal was seen, but this was a rare 
event at ~0.2%, and is most likely a double event 
coupled with non disjunction rather than three separate 
integrations. In toto, the data suggests that 

illegitimate integration in mammalian cells occurs 

25 primarily at a single site, stochastically, showing no 
bias for single chromosomes or chromosomal locations at 
megabase resolution . 

Interphase analysis of far ectopic events 

Interphase FISH analysis was conducted for 9 
30 clones (2 derived from Al, 4 from A6 and 3 from A14 ) 
which contained far ectopic integration events on 
distinct chromosomes from the target. For each of the 
9 clones distances between target (green) and recipient 
(red) sequences were measured to determine the 
35 frequency of co-localization of signals. Similar 
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measurements were made for pools of clones containing 
illegitimate integrations of the recipient, derived 
from cell lines Al and A6, to provide a random or 
"unlinked" distribution for co-localization of red and 
5 green signals. In addition, two clones from Al and one 
clone from A14, containing close ectopic integrations, 
were used as a "linked" control for co-localization of 
FISH signals . 

In a total of 247 nuclei observed for the 9 far 

10 ectopic clones, 3 5 nuclei (~14%) exhibited coincident 
or nearly coincident red and green signals. In 
contrast, a pool of ~440 clones containing random 
illegitimate integration events (unlinked loci) did not 
exhibit co-localization of red and green signals in 69 

15 nuclei observed. The close (linked) control exhibited 
63 coincident or nearly coincident FISH signals in all 
63 nuclei observed. The distribution of inter signal 
distances for both the random (unlinked) control and 
the pool of far ectopic events approximated a normal 

20 distribution with the exception of a significant 
deviation (p < 0.0001) for the number of co- 
localizations for the far ectopic events (see bin 0, 
Fig. 5). These results suggest that during interphase 
the site of integration of the recipient vector in far 

25 ectopic clones is found in close proximity to the 
target locus in a significant number of nuclei. 
Discussion 

Ectopic gene targeting exhibits a bimodal 
distribution in murine fibroblasts- The recipient DNA 

3 0 molecule may integrate nonrandomly within 3 Mb of the 
target or may integrate in other chromosomes, perhaps 
randomly. If indeed far events are random then 

integration in the target chromosome at distances 
beyond 3 Mb from the target locus may also occur. In 

3 5 our study, no far ectopic integrations where seen on 
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the same chromosome as the target locus, nonetheless, 
such events cannot statistically be excluded. A 
bimodal distribution was observed for 3 9 ectopic gene 
targeting events analysed for 3 independent target loci 
5 (either separately or in pools). In contrast, 

illegitimate integration of the recipient vector was 
shown to be random at the resolution of our FISH 
analysis. Our experiments were carried out on 

unsynchronized cell populations such that the effects 

10 of the cell cycle can not be addressed directly. 
Neverthless, the results were highly reproducible 
between independent experiments. 

The results obtained in accordance with the 
present invention, lead us to suggest a model for 

15 genomic domain interactions that takes into account the 
observed bimodal distribution of ectopic gene 
targeting. FISH analysis of the integration pattern of 
illegitimate events indicates that at a resolution of 
10-20 Mb there is no evident bias with respect to 

2 0 chromosomal location. In other words, integration 
appears random. This is the case despite the presence 
of the homologous genomic target. Thus, it appears 
that homology per se does not act as a determining 
factor in the localization of the integration site. 

25 Yet when a double strand break is introduced in genomic 
homologous sequences, integration occurs highly 
preferentially at the site of the break by homologous 
recombination (Rouet, P. et al . , Mol . Cel. Biol. 
14 : 8096-8106, 1994). From these observation it would 

30 seem that the location of the integration site is first 
and foremost determined by the occurrence of a double 
strand break in genomic DNA and that if this DSB occurs 
at or near homologous sequences then the integration 
will most likely involve homologous recombination. 
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If natural DSBs occur randomly in the genome , 

then a DSB in any given 2 megabase unit should occur at 

— 4 

a frequency of about 3.3 x 10 considering there are 
9 

6 X 10 bases in the diploid murine genome. Since the 
5 overall frequency of illegitimate recombination for our 
cell lines is 6 X 10 ^, then the frequency of 
integration at a DSB within a 2 megabase domain 
containing the target should be the product of the two 
(3.3 x 10" 4 multiplied by 6 X 10~ 3 ), thus 2 X 10~ 5 . 

10 Interestingly, this is the frequency at which ectopic 
gene targeting is noted which is also similar to the 
frequency of traditional gene targeting. This 
similarity in frequencies has been observed previously 
by other groups. These frequencies are in agreement 

15 with integration involving first a DSB break which, if 
it occurs at or near a homologous target, will result 
in gene targeting. Thus, this could explain the 
ectopic gene targeting events that occur in close 
proximity to the target. According to the above 

20 reasoning, this would mean that the DSB occurred in a 
domain that, although situated in a distinct 
chromosome, was in close proximity to the domain 
containing the homologous target at the time of the 
ectopic gene targeting event. In support of a close 

25 association of far ectopic sites to the target locus at 
the time of recombination, a significant number of co- 
localizing recipient and target signals were observed 
by FISH in interphase nuclei of 9 separate far ectopic 
clones produced from mother clones Al, A6 and A14. In 

30 contrast, no co-localizations were observed for pools 
of random illegitimate events which exhibited a normal 
distibution of inter signal distances between recipient 
and target sequences at interphase. It is tantalizing 
to speculate that such domain associations may be 

35 occurring in a cell cycle specific manner linked with 
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either replication or transcription. A "Double Strand 
Break Proximity" model, presented in Fig. 6, summarizes 
this hypothesis. 

An obvious alternative to this model would be to 
5 invoke that gene conversion between the target and the 
exogenous vector occurs first, followed by release of 
the vector and random integration in the genome via 
illegitimate recombination. The frequencies for each 
of these events do not support this hypothesis. Gene 

10 conversion between an exogenous vector and a genomic 
homologous target has been measured and were found to 
be <10 6 . Since the frequency of illegitimate 

integration in our assay is 6 X 10 3 , then the 
frequency of ectopic gene targeting, if it occurred in 

15 these two successive steps, should be the product of 

_ o _ q 

the frequencies of each step or 10 to 10 . This is 
at least two orders of magnitude lower than what is 
seen for the frequency of ectopic gene targeting. Of 
course it may be that, after the gene conversion step, 
20 the exogenous vector becomes highly potentiated for 
integration. Taking this possibility into account, 
still the site of integration should be near the target 
site as depicted in Fig. 5, to explain the observed 
bimodal distribution which is therefore not random. 

2 5 In support for the concept that integration is 

driven by a DSB, It has been observed that illegitimate 
integration of an exogenous vector occurs at only one 
site, greater than 90% of the time (Folger, K. R. et 
al., Mol. Cell. Biol. 2:1372-1387, 1982; Richard, M. et 
30 al., Mol. Cell. Biol. 14:6689-6695, 1994). 
Furthermore, when there is a gene targeting event, 
rarely is there also in the same cell a separate 
illegitimate integration event even though the later 
occurs usually a thousand times more frequently than 

3 5 the former. Thus, this suggests that in most of the 



WO 98/55610 




- 40 - 




7CA98/00532 



cells where an integration event (homologous or 
illegitimate) occurs, there is only one genomic site 
available for integration. Recently, it has been shown 
that as few as one double strand break can cause p53- 
5 dependent cell cycle arrest in human embryonic 
fibroblasts (Huang, L. C. et al . , Proc. Natl. Acad. 
Sci. USA 93:4827-4832, 1996). CaP04 treatment of cells 
alone can also induce p53 cell cycle arrest (Renzing, 
J., et al., Oncogene 10x1865-1868, 1995). Cell cycle 

10 arrest triggered by CaP04 and/or DNA damage may 
therefore provide a means of limiting the number of 
DSBs that can accumulate during a given cell cycle, 
thus limiting the potential number of sites of 
integration of exogenous DNA. 

15 The number and types of close (targeted or 

crossover) events and far ectopic gene targeting events 
were distinct for all three loci . These trends were 
even more evident upon pooled analysis of an additional 
24 G418 clones from Al (12 clones), A6 (6 clones) and 

20 A14 (6 clones). Clone Al . had the highest number of 
close events and the least far ectopic integrations, 
followed by A14 and then A6 (with the most far events 
and only one close event). Only clone A6 showed a 
strong preference for a specific chromosome location 

25 and morphology, where "-64% of ectopic integrations 
occurred in the mid arm of acrocentric chromosomes . Al 
and A14 showed weaker trends for specific chromosomes 
but this may only be due to the reduced number of 
distant ectopic integrations seen for these clones. 

30 Since the distribution of close and far events differs 
between loci this suggests a site specific effect on 
ectopic gene targeting. 



is organized in a coherent manner such that gene 
35 sequences may be accessed at certain points in the cell 



It is apparent that chromatin within the nucleus 
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cycle for replication and transcription. 

Compartmentalization seems to occur for these processes 
(ex. transcription factories (Cook, P. R., J. Cell Sci . 
108:2927-2935, 1995); replication factories (Jackson, 
5 D. A., Bloessays 17:587-591, 1990)), the factors they 
require (ex. splicing factors (Spector, D.L. et al., 
Cold Spring Harbor Symp>. Quant. Biol. 58:799-805, 
1993)) and the chromatin involved (chromosome 
territories (Cremer T. et al., Cold Spring Harbor Symp. 

10 Quant. Biol. 58:777-792, 1993)). The transcription and 
replication factories may be able to organize DNA 
domains from the same chromosome or from distinct 
chromosomes in such a way that accessibility of one 
domain to another may be enhanced- Using this line of 

15 reasoning, it may not be at all surprising that domains 
on distinct chromosomes have access to each other. 

Ectopic gene conversion occurs naturally among 
non allelic sequences in many organisms and the 
mechanism of ectopic gene targeting also seems to be 

20 conserved across phila. In the present Application, it 
has been demonstrated that ectopic gene targeting 
exhibits a bimodal distribution of integration in 
murine cells. This indicates that both intra and 
interchromosomal sites are accessible to the targeting 

25 vector. Thus, the RAM method in accordance with the 
present invention may by used to analyse ectopic gene 
targeting and to determine which chromosomal domains 
within the genome are accessible to a given genetic 
locus . 

3 0 While the invention has been described in con- 

nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 
tions, uses, or adaptations of the invention following, 

35 in general, the principles of the invention and 
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including such departures from the present disclosure 
as come within known or customary practice within the 
art to which the invention pertains and as may be 
applied to the essential features hereinbefore set 
5 forth, and as follows in the scope of the appended 
claims . 
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WHAT IS CLAIMED IS : 

1. A method to identify regions of chromosomes that 
interact functionally within or between chromosomes 
which comprises the steps of: 

a) recombining a linear recipient vector with a 
target sequence present in at least one 
chromosome, wherein said recipient vector 
comprises at least a recipient sequence 
consisting of a nucleic acid sequence 
complementary to recombine with said target 
sequence and a tail consisting of a heterologous 
nucleic acid sequence; and 

b) identifying at least one region of the 
chromosome capable of recombining with the 
recipient vector by PCR means or by detecting a 
label and/or a tag on the recipient vector or 
the target sequence. 

2. The method of claim 1, further comprising the 
step of : 

c) characterizing the region identified in step b) , 
thereby mapping said region on a chromosome or a 
segment thereof. 

3. The method of claim 1, wherein the target 
sequence is a modified vector comprising a genomic 
sequence and a reporter gene or fragment thereof, 
allowing for the identification of said target 
sequence. 

4. The method of claim 3, wherein the genomic 
sequence is a gene or fragment thereof. 
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5. 



The method of claim 1, wherein the target 



sequence is modified to introduce a first part of a 
reporter gene, and wherein the recipient vector 
comprises a second part of the reporter gene, such that 
recombination of the recipient vector with the target 
sequence allows the reporter gene to be functional, 
thereby allowing for the identification of said target 
sequence. 

6. The method of claim 1, wherein the recipient 
vector further comprises at least one fluorescence in 
situ hybridization signal (FISH) or one radioactive in 
situ hybridization signal allowing for detection of in 
situ hybridization of the recipient vector with thie 
target sequence. 

7. The method of claim 1, wherein the recipient 
vector further comprises a signal in situ hybridization 
for detection by electron microscope in situ 
hybridization of the recipient vector with the target 
sequence . 

8. The method of claim 1 / wherein the tail has a 
sequence of at least 1 Kb in length . 

9. The method of claim 1, wherein the tail has a 
sequence of about 10 Kb in length. 

10. The method of claim 1, wherein the recipient 
nucleic acid sequence is at least 300 bp in length. 



11. The method of claim 1, wherein the recipient 

nucleic acid sequence is at least 500 bp in length. 
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12. The method of claim 1, wherein the recipient 
nucleic acid sequence is about 7 00 bp in length. 

13. The method of claim 1, wherein the reporter gene 
is a selection gene. 

14. The method of claim 13, wherein the selection 
gene is selected from the group consisting of neomycin, 
puromycin, hygromycin and herpes simplex thymidine 
kinase . 

15. A method to identify regions of chromosomes that 
interact functionally within or between chromosomes 
which comprises the steps of: 

a) recombining a linear recipient vector with a 
target sequence present in at least one 
chromosome, wherein said recipient vector 
comprises at least a recipient sequence 
consisting of a nucleic acid sequence 
complementary to recombine with said target 
sequence ; 

b) identifying at least one region of the 
chromosome capable of recombining with the 
recipient vector by inverse polymerase chain 
reaction (PCR); and 

c) characterizing the region identified in step b), 
thereby mapping said region on a chromosome or a 
segment thereof. 

16. A method to identify regions of chromosomes that 

interact functionally within or between chromosomes 
which comprises the steps of: 

a) recombining a linear recipient vector with a 
genomic sequence, wherein the linear recipient 
vector comprises a recipient sequence of about 
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700 bp in length and a tail comprising a 
heterologous nucleic acid sequence of about 
10 Kb in length and at least one fluorescence in 
situ hybridization signal (FISH) allowing for 
detection of in situ hybridization of the linear 
recipient vector with the genomic sequence, said 
genomic sequence comprises a sequence of a gene 
present in at least one chromosome, the 
recipient sequence is a nucleic acid sequence 
complementary to recombine with said sequence of 
the gene and the genomic sequence is modified to 
introduce a first part of a neomycin gene, the 
recipient vector comprises a second part of the 
neomycin gene, such that recombination of the 
recipient vector with the genomic sequence 
allows the neomycin gene to be functional, 
thereby allowing for the identification of said 
genomic sequence; 

b) identifying a region of the chromosome to which 
the recipient vector is recombined by selecting 
surviving cells in neomycin-containing medium, 
thereby indicating cells containing recombined 
recipient vector with the genomic sequence 
allowing for the expression of the neomycin 
gene, and detecting at least one fluorescence in 
situ hybridization signal; 

c) characterizing the region identified in step b), 
thereby mapping said region on a chromosome or a 
segment thereof - 

17. A method to map the distribution of double 

strand breaks in a chromosomal DNA sequence, said 
method comprising the steps of: 

a) recombining a linear recipient vector with a 
chromosomal target sequence present in at least 
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one chromosome, wherein said recipient vector 
comprises at least a recipient sequence 
consisting of a nucleic acid sequence 
complementary to said target sequence and a 
specific DNA sequence for polymerase chain 
reaction (PCR) amplification or for fluorescent 
in situ hybridization (FISH) analysis and 
wherein at least the recipient sequence or the 
target sequence contains a recognition site for 
an enzyme or chemical means for inducing a 
unique and specific double strand break within 
said recipient or target sequences; 

b) identifying at least one region of the 
chromosome recombining with the recipient vector 
by inverse PCR or plasmid rescue; 

c) characterizing the size and position of a 
genomic domain containing the chromosomal target 
and recipient sequences by at least FISH 
analysis or pulse field gel electrophoresis, 
following the formation of specific double 
strand breaks within the chromosomal recipient 
and target sequences by enzymatic or chemical 
means; 

d) cloning at least in part the identified genomic 
domain of step c ) ; and 

e) DNA sequence analysis of the genomic domain 
cloned in step d) , thereby characterizing the 
genes and at least functional or structural 
sequence elements within the identified genomic 
domain. 

18. The method of claim 17, wherein the functional 

or structural elements are selected from the group 
consisting of origin of replication, matrix attachment 
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sites, transcription factor binding sites, imprinting 
centers and insulator elements. 
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